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Title of the Invention 
Automated Database Assistance via Telephone 

Field of the Invention 
The present invention relates to automated database assistance, 
particularly via telephone and more particularly, but not exclusively, to an 
automated directory assistance service. 

Background of the Invention 

Directory assistance, and more generally, the interrogation of a database 
via a telephony device, is currently achieved by using telephone operators to 
manually input data received in voice form from a user for interrogation of the 
database. The use of manual operators is generally preferred because they are 
more adept at interpreting voice input from enquirers than is voice recognition 
software. For example, the voice input may be made in a variety of accents 
and may be made over better or worse connections, and under these 
circumstances manual operators are generally preferred. 

Furthermore, a manual operator is able to judge whether or not a word 
has been heard correctly so as to decide whether to ask a user to repeat the 
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word or to carry out a search on the basis of the presumed input. The manual 
operator is also able to start the search and then to ask the user to give further 
information until the search is completed, or to decide that the search cannot be 
completed. Automatic methods of obtaining voice data lack the flexibility for 
making effective decisions in obtaining data and using the data in the search. 

The use of human operators is the largest cost element attendant in 
running a telephony-based database enquiry service. Generally, the call 
volume varies with the time of day, with notable peak times. The use of human 
operators therefore requires considerable scheduling in order to meet peak 
demand. 

Generally, a basic directory enquiry service is mandated by a 
telecommunications regulator, and value-added variations or improvements on 
the basic service may be provided as premium level services by the 
telecommunication service operator or by others. 

Two particular known systems that address the above-mentioned 
problems include: (1) a rerouting system that routes calls to other call centers at 
peak times, and (2) an automation system that uses an automatic dialogue to 
obtain information from the caller. In the automation system, the recorded 
information is played back to an operator who may immediately initiate the 
search. 

A further approach is to be described in US Patent No. 5,638,425, 
Meador et al, entitled "Automated Directory Assistance System Using Word 
Recognition and Phoneme Processing Method", the contents of which are 

2 



herein incorporated by reference. The patent discloses a mechanized directory 
assistance system for use in a telecommunications network, which includes 
multiple speech recognition devices comprising a word recognition device, a 
phoneme recognition device, and an alphabet recognition device. The system 
also includes a voice processing unit, and a computer operating under stored 
program control. A database is utilized which may comprise the same database 
as used for operator directory assistance. The directory assistance system 
operates as follows: A directory assistance caller is prompted to speak the city 
or location desired. The response is digitized and simultaneously inputted to the 
word and phoneme recognition devices, each of which outputs a translation 
signal plus a probability level signal. The probability level signals are 
compared and the highest probability level translation is selected. The caller is 
then prompted to speak the name of the sought party. The response is processed 
in the same manner as the location word. In the event that the probability level 
fails to meet a predetermined standard, the caller is prompted to spell all or part 
of the location and/or name. The resulting signal is inputted to the alphabet 
device. When translations are obtained having a satisfactory probability level 
the database is accessed. If plural listings are located from the database, then 
these listings are articulated and the caller is prompted to respond affirmatively 
or negatively to each listing. When a single directory number has been located, 
a signal is transmitted to the caller to articulate this number. The system also 
allows DTMF keyboard input in connection with the spelling procedure. 
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Summary of the Invention 

Embodiments of the present invention provide a way to automatically 
search databases remotely using remote contact devices, irrespective of 
whether such remote contact devices are designed for text based 
communication or speech-based communication. 

According to a first aspect of the present invention there is thus provided 
an interface for remote user input for reading a database, the interface 
comprising 

an automatic question unit operable to determine whether a user is 
connected via a voice-based or a text-based or a combined voice-text capable 
communication link and for eliciting input from a user via either one of said 
voice-and text based communication in accordance with said determination, 
a speech recognition unit for recognizing human speech input, 
a data recognition unit for recognizing remote data input, and 
a query formulation unit operable both for formulating a searchable 
query from said recognized input and for prompting said automatic question 
unit to elicit further input from a user, 

and wherein the interface is associated with a database to search said 
database using said recognized input. 

Preferably, the said speech recognition unit comprises a speech to text 
converter operable to convert said user input into query material for said 
database, and wherein said database comprises text entries. 
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Preferably, the speech recognition unit comprises a speech to phoneme 
converter operable to convert said user input into query material for said 
database, and wherein said database comprises entries made up of groups of 
one or more phonemes. 

Preferably, the speech recognition unit comprises a combined speech to 
text converter and speech to phoneme converter, operable to convert said user 
input into query material for said database. 

A preferred embodiment further comprises a confidence level 
determiner, associated with said speech recognition unit, said determiner being 
operable to determine a level of confidence from a plurality of possible 
confidence levels for a recognition instance of said speech recognition unit. 

A preferred embodiment comprises an output unit for outputting a 
search result, wherein said output unit comprises speech and data output 
abilities and a selector for selecting an output ability to be used based on a user 
data receipt ability. 

A preferred embodiment is interfaceable to a mobile telephone data 
facility. 

Preferably, the mobile telephone data facility is any one of a group 
comprising WEB, WAP, plain text and SMS. 

A preferred embodiment is interfaceable to a messaging service. 

Preferably, the query formulation unit is operable to submit a recognized 
speech input as a query for search to said database and, in the event of failure 



to match, is further operable to prompt said question unit to ask a user to spell 
said input. 

A preferred embodiment comprises associative linkage between 
associated names for widening searches on the basis of variations of input 
names. 

Preferably, the database is a contact directory giving at least one contact 
point for each one of a plurality of searchable entries. 

Preferably, for any searchable entry having more than one contact point, 
a hierarchy of contact point types is provided to define which of said contact 
points to output first. 

Preferably, a contact point is usable as an input to obtain a searchable 

entry. 

Preferably, the automatic voice question unit is programmable with a 
plurality of questions, and a number of said questions is determinable 
according to a size of the database. 

Preferably, the questions are storable in a hierarchy which corresponds 
to a predetermined search strategy for the database, and wherein said automatic 
voice question unit is operable to stop asking questions as soon as sufficient 
information has been obtained to terminate a database search. 

A preferred embodiment is operable to pass a user to a human operator 
when said hierarchy of questions is ended and a database search has not been 
terminated. 
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A preferred embodiment is operable to pass a user to a human operator 
when a user input is not translatable into material usable for searching said 
database. 

A preferred embodiment further comprises a confidence level 
determiner, associated with said speech recognition unit, and operable to 
determine a level of confidence for a recognition instance of said speech 
recognition unit, said confidence level determiner being further operable to 
pass a user to a human operator when a user input is associated with a 
confidence level lower than a predetermined confidence threshold. 

A preferred embodiment comprises a switch for connecting a user to a 
contact point retrieved from said database. 

A preferred embodiment comprises a data exchange mechanism 
operable to bring about data interactivity between said database and a remotely 
located user database. 

A preferred embodiment is operable to insert an identification of a caller 
into a header of a message left by said caller. 

Preferably, the contact point is a telephone number. 

Preferably, the identification is any one of a group comprising a text 
string, a photograph an audio sequence and a video sequence. 

Preferably, the database is searchable to retrieve a location and wherein 
said retrieved location is superimposable on one of a group comprising a map, 
a video and a photograph. 
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A preferred embodiment includes a graphical output unit operable to 
send said map to said user. 

Preferably said map is in a location system operable to determine a 
current location of a user and which location system is further operable to trace 
a route from said current location to said retrieved location. 

Preferably, the database comprises results fields including any one of a 
group comprising a text string field, a photograph field and a video sequence 
field. 

Preferably, the question unit comprises a speech output operable to 
output questions in spoken form to users connected via speech-enabled devices 
and a text output to output questions in text form to users connected via text- 
enabled devices. 

According to a second aspect of the present invention there is provided a 
location system connectable to a location database comprising geographic 
location data associated with personal identification data such that said 
personal identification data is usable in search queries to obtain an associated 
location, which location system comprises a positioner for determining a 
current position of an enquirer, and which is operable to receive said location 
data from said location database in response to a query involving said personal 
identification data, which location system further comprises a route determiner 
for determining a route from said current position to said desired location using 
said location data. 
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Preferably, the location database is a directory associating subscriber 
identification data with subscriber address data. 

A preferred embodiment has a graphical output operable to output said 
route as a route on a map. 

Preferably, the output is operable to output said route in real time. 

A preferred embodiment has a combined voice and text output operable 
to determine whether a user is connected via voice capable or text capable 
communication and operable to output said route as a sequence of instructions 
in text and voice format in accordance with said determination. 

Preferably, the combined voice and text output is operable to output said 
sequence of instructions in a preselected language. 

Preferably, the sequence in said preselected language is obtainable from 
a corresponding sequence in a base language by real time automatic translation. 

Preferably, the location data comprises map co-ordinates. 

Preferably, the personal identification data comprises street address data 
and wherein said positioner is operable to translate street address data into 
corresponding map co-ordinates. 

According to a third aspect of the present invention there is provided an 
interfacing method for remote user input for reading a database, the method 
comprising 

determining, as a connection type, whether a user is connected via a 
voice-based or a text-based or a combined voice-text capable communication 
link, 



eliciting input from a user via either one of voice- and text based 

communication according to said connection type, 

recognizing human speech input to said interface, 

recognizing remote data input to said interface, 

formulating a searchable query from said recognized input, 

prompting said user via automatic questioning to elicit further input 

from a user unless a query sufficient for searching said database has been 

formulated, 

and searching a database using said query. 

A preferred embodiment comprises determining whether an ambiguous 
answer is received from said database, and 

if an ambiguous answer is received then prompting said automatic 
question unit to elicit further input from a user until a more sufficient query has 
been formulated. 

According to a fourth aspect of the present invention there is provided a 
method of remotely reading a database via a remote communication device 
having a communication mode, comprising: 

entering a query request via said remote communication device in said 
communication mode, 

sending said query request to a communication interface in said 
communication mode, 

receiving instructions in said communication mode for entering query 
items to form a database search query, 
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sending said query items in said communication mode for said interface 
to form a query for interrogating said database in a database interrogation mode 
to produce a query result in said database interrogation mode for translation by 
said interface into said communication mode, and 

receiving said result at said remote communication device from said 
interface in said communication mode. 

Preferably, the communication mode is a mode of voice communication. 

Alternatively, the communication mode may be a mode of text 
communication. 

The database interrogation mode may be a mode of text communication, 
or a mode of communicating using phonemes. 

Brief Description of the Drawings 
For a better understanding of the invention and to show how the same 
may be carried into effect, reference will now be made, purely by way of 
example, to the accompanying drawings, in which: 

Fig. 1 is a block diagram showing an automatic interface between a 
remote user and a database, which interface is operable to search a database 
using queries formulated on the basis of information gathered over a telephony 
link from a remote user; 

Fig. 2 is a flow chart showing operation of the interface of Fig. 1 ; 

Fig. 3 is a flow chart showing in greater detail the selection of of a 
further question in the operation of Fig. 2; 
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Fig. 4 is a flow chart showing in greater detail the selection of a farther 
question in the event of low probability recognition of user input; and 

Fig. 5 is a block diagram of an architecture for supporting automatic 
directory assistance, in accordance with a preferred embodiment of the present 
invention. 

Description of the Preferred Embodiment 
Before explaining at least one embodiment of the invention in detail, it 
is to be understood that the invention is not limited in its application to the 
details of construction and the arrangement of the components set forth in the 
following description or illustrated in the drawings. The invention is applicable 
to other embodiments, and is able to be practiced and carried out in various 
ways. Also, it is to be understood that the phraseology and terminology 
employed herein is for the purpose of description and should not be regarded as 
limiting. 

Reference is now made to Fig. 1, which is a block diagram showing an 
automatic interface between a remote user and a database, which interface is 
operable to search a database using queries formulated on the basis of 
information gathered over a telephony link from a remote user. Thus a remote 
user is enabled to use whatever facilities are available or most convenient on 
his telephony device, to obtain data from the database. 

An interface 10 receives input from remotely located telephony devices 
12 and 14 via the public switched telephone network (PSTN) 15. Data 
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received from the remote telephony devices is most likely to be in voice form 
but may also be in the form of data input via a keyboard. Such data may 
typically be transferred over the PSTN 15 as DTMF tones but possibly also as 
digital data, depending on the communication device. If the Internet, or like 
data based communication network is used then the data is preferably 
transmitted as is. 

The data input from the user is intended to elicit information and thus 
preferably comprises query material which may be made to correspond to one 
or more searchable fields of an associated database. To obtain user input in a 
way that can be inserted into a database, the interface preferably comprises an 
automatic question unit 1 8 for asking questions of the user with the aim of 
obtaining the data in a structured manner. The interface is able to determine 
whether the user is communicating via voice or text, preferably simply by 
analyzing the initial input of the user initiating the search procedure, 
alternatively by analyzing a handshake procedure used to set up the channel. 
The skilled person will be aware of other ways of carrying out such a 
determination. If the user is using voice communication then the question unit 
18 preferably communicates using voice. The questions may thus be 
prerecorded or they may be formulated afresh and spoken by a speech synthesis 
unit. If the user is using text based communication then text based versions of 
the questions may simply be sent to the user as text messages. 

In many telephony devices, voice is the only way in which data can be 
received. Relatively few devices for example have a screen for displaying data. 
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On the other hand, all telephony devices have a keyboard of some kind and 
keyboards can be used to provide textual input. Nevertheless the telephone 
keyboard is rarely a full keyboard and generally speech is the easiest way to 
send information from such a device. Thus the interface as illustrated has both 
a data recognition unit 20 and a speech recognition unit 22 so that it is able to 
recognize any kind of input that is likely to be sent from a telephony- enabled 
device. 

Returning to the automatic question unit and selection of questions, and 
as mentioned above, the questions are intended to obtain specific responses 
from a user in such a way that they can be related to individual searchable 
fields in the database. The questions are preferably hierarchically arranged in 
terms of more important and less important fields so that more important fields 
are filled first. Thus, if the important fields are successfully filled, remaining 
less important fields can be ignored, being used only if an initial search based 
on the important fields is unsuccessful. Such a hierarchical arrangement 
reduces the total number of questions asked and leads to more efficient 
searches. 

Where the input is textual, the data recognition unit 20 is initially 
informed by the Interface 10 that textual information is being received and then 
interprets the received information in relation to the individual fields in the 
database 16. The data recognition unit 20 is preferably capable of recognizing 
DTMF tones. In general, speech and data connections can be distinguished by 
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the interface inter alia on the basis of a connection handshake procedure used 
to set up the connection, as mentioned above. 

The speech recognition unit 22 is preferably informed by the Interface 
10 that incoming data is in the form of speech, and then interprets the speech in 
terms of the searchable fields in the database 16, as will be explained in more 
detail below. As mentioned above, the interface may inter alia determine the 
presence of speech data from a handshake or call setup procedure, which is 
generally carried out differently for setting up voice or data links. 

The interface 10 preferably further comprises a query formulator 24 
which arranges the information identified by the recognition units 20 and 22 in 
terms of database fields in order to form queries for interrogating the database. 
The query formulator takes information received from the enquirer and 
arranges it fieldwise to form a query. For example it may take data received in 
response to a question about the surname and data received in response to a 
question about an initial and may form them into a query of the form 'surname 
+ initial', which query can be applied to a search engine for interrogating the 
database. 

The query may be submitted once a threshold number of fields have 
been filled in. In a preferred embodiment the threshold number of fields is a 
function of the size of the database. In a further preferred embodiment the 
threshold may be a function, not just of the number of fields filled in but also 
the degree of certainty within those fields. That is to say, since data received 
from speech is not necessarily unambiguously recognized, a strength level of 
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the query may be adjudged from the level of certainty in the fields as well as 
the number of fields filled in. Only queries whose strength level exceeds the 
threshold are submitted to the database. 

If the query leads to an unambiguous query result from the database then 
the unambiguous result is output from the database 16 to an output unit 28 
which sends the data to the remote enquirer. Preferably, if the result is not 
ambiguous, the user is asked further questions by the voice question unit 18 so 
as to be able to use further searchable fields of the database in an improved 
query. To this end the automatic voice question unit 18 is programmable with 
a plurality of questions arranged as a hierarchy, meaning that the questions are 
arranged for making optimal use of the database structure. Thus, a field that is 
always crucial in a search is requested early on. A field that is mostly useful in 
resolving between already retrieved results is asked later on. Again, fields may 
also be arranged in accordance with the way in which the enquirer is likely to 
relate to the query. For example a user is likely to input a first name before a 
family name even though the family name is generally the more important 
search field. 

In a preferred embodiment the size of the database is taken into account 
in deciding how many fields are requested from the user before initial 
submission of the query. For example a small database having just a few 
thousand names would probably be able to process the vast majority of queries 
on the basis of a family name alone. Thus a number of questions to be asked 
initially is determined according to the size of the database and according to the 
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quantity of information likely to be needed to resolve the average query. This 
is particularly helpful since users are not likely to be comfortable using a 
system that asks them large numbers of questions to obtain relatively little 
information. 

Preferably, if a predetermined hierarchy of questions is exhausted 
without a search being satisfactorily completed, or if information obtained from 
the user cannot be translated into data usable for the search, or if the user 
requires it specifically, then the user is transferred to a human operator. 

In one embodiment, the speech recognition unit 22 comprises a speech 
to text converter. Correspondingly, the database 16 comprises text entries and 
is searchable using the output of the speech to text converter when processed 
by the query formulator. The speech to text converter is known in the prior art, 
and in particular is discussed in US Patent Application No. 5,638,425 referred 
to above. 

In another embodiment, the speech recognition unit 22 comprises a 
speech to phoneme converter. Correspondingly, the query is made up from the 
resulting phonemes, the database 16 comprises entries made up of groups of 
one or more phonemes, and database searching involves the matching of 
phonemes to arrive at a search result giving a required database entry. The 
speech to phoneme converter is known in the prior art, and in particular is 
discussed in US Patent Application No. 5,638,425 referred to above. 

Speaker independent speech recognition hardware that can be used in 
the above-mentioned embodiments is available as a range of processor chips 

17 



from Sensory, Inc. 1991 Russell AvenueSanta Clara, CA 95054-2035, USA. 
Text regognition hardware is also available from the same source. 

In a particularly preferred embodiment, as illustrated in Fig. 1, the 
speech recognition unit 22 includes both a speech to text converter 23a and 
speech to phoneme converter 23b. The database may correspondingly comprise 
searchable fields stored in both text and phoneme format. In one possible 
mode of operation the interface 10 may select for the query whichever of the 
text or phoneme output has been assigned a higher level of confidence by the 
confidence assignor 26 for use in the search. In another mode of operation the 
interface may use both text and phoneme based queries in the search. 

Preferably, the interface comprises a confidence assignor 26. The 
assigner is preferably associated with the speech to text converter or speech to 
phoneme converter referred to above. In general, when converting speech, 
pattern matching or like techniques are used to match the incoming sound to 
templates and the template giving the closest match is recognized as 
corresponding to the input. However, absolute measures of the accuracy of the 
match may be available from the matching process and these can be used to 
associate any given result with a confidence level. For example, a cross- 
correlation may be used to determine a match between an input waveform and 
each of a series of templates. The template giving the highest cross-correlation 
is selected and the level of the cross-correlation function may then be used to 
provide a level of confidence to the match. The level of confidence can then be 



used by the interface 10, for example to make decisions about repeating 
questions or referring the user to a human operator. 

Thus, preferably, the interface is operable to submit an input obtained in 
the form of speech to formulate a query for searching the database 16. In the 
event of a failure to produce an unambiguous result from the database 16, or in 
the event of failure to match text or phoneme templates to the speech input or 
in the event of a match of text or phoneme templates but at a low level of 
confidence, the voice question unit is further operable to ask a user to spell the 
input. In the case of spelling an input, then at least in the case of the English 
keyboard, there are only 26 possible results that have to be considered, and if 
taking into account the full alphanumeric character set then there are 36 
possible results. These 26 or 36 results generally have relatively large 
distances between them, and so the possibility for error or confusion is reduced. 
The skilled person will appreciate that the number of results will be different 
for non-English language character sets. 

The interface preferably comprises output unit 28 for outputting a search 
result. Preferably, the output unit comprises speech and text output capabilities 
and a selector for selecting an output capability to be used based on a user data 
receipt ability, preferably determined by the interface 10. In general, set up 
sequences for communication links are entirely different for data and voice 
links and the interface is able to determine from the setup procedure how to set 
the selector. 
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Consequently, the output unit is preferably able to deliver output in the 
form of speech. In addition, the output unit 28 may provide output in the form 
of text for those users connected via a device able to display text (including 
output in form of fax and email). In the particular application of an automated 
directory enquiry service, the output unit may additionally provide the user 
with the ability of dialing a telephone number obtained as a result of the 
enquiry. 

The interface 10 is preferably interfaceable to a specific mobile 
telephone data facility such as WAP. WAP allows the user to connect via the 
Internet and carry out a search textually. Furthermore, a WAP interface can 
allow a user to speak his input into the phone and may allow spoken output to 
be given to the user. 

Additionally or alternatively to WAP, the service may be interfaceable 
to an instant messaging service. Thus the data recognition unit 20 preferably 
receives a message initiating a query and, in response, sends a reply comprising 
a first one of a hierarchy of questions appropriate to the data fields to be 
searched. The query formulator 26 receives the user's responses and builds 
them into a query which is submitted to the database 16 as soon as sufficient 
information is judged to be available. 

In applications involving name-based searching, it is widely appreciated 
that individual names do not exist in isolation but rather come with different 
spellings, or in shorter and longer versions. In particular, first names are often 
associated with nicknames and in addition, may appear in a database either 
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spelled out in full or appearing as initials. Often, when considering a database 
such as a telephone directory, a first name being used in the query may appear 
in the database only after a spouse's first name. For example the enquirer may 
be asking about Anne Jones. But Anne Jones may appear in the database as 
Jones A, or as Jones, Michael and Anne. Likewise Michael Jones may appear 
in the database as Jones, M or as Jones, Mike or as Jones, Michael. 

Thus, in one embodiment of the present invention the interface includes 
a names database 30 defining associations, or associative linkage, between 
names, and a search query is built up based on the name given by the user 
together with associated entries in the names database 30. The names database 
preferably comprises associations between entries comprising different ways in 
which a given query name may be expected to appear in the database 16, 
alternative spellings, nicknames etc. and including common mistakes. 

As mentioned above, a particularly preferred embodiment of the present 
invention is an automated directory enquiry service, in which case the database 
16 is a contact directory giving at least one contact point, such as a telephone 
number, for each one of a plurality of searchable entries such as subscribers to 
the telephone company associated with the database 16. 

In such a database, the individual subscriber may have more than one 
contact point listed, perhaps a home telephone number, a fax number, a mobile 
telephone number, an office telephone number, an e-mail address and home/ 
office address. These different types of contact points may be arranged in a 
hierarchy so that an enquirer is always given one of the contact points first 
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unless he specifies one of them in particular. In a preferred embodiment, the 
system uses the retrieved contact point to dial and contact the subscriber at the 
request of the enquirer. In a preferred embodiment the system tries the contact 
points in order, down the hierarchy (or even simultaneously) until the 
subscriber is contacted. In another embodiment, the enquirer may set his own 
hierarchy of contact point types for contacting others and in another 
embodiment the subscriber may set a hierarchy of his own contact points for 
the system to use when attempting to contact him. Furthermore the subscriber 
may set his hierarchy to change during the day. For example, knowing that he 
is generally in the office at certain hours, traveling at certain hours and at home 
at other times, he may choose to set the relevant contact points to be at the top 
of the hierarchy at the respective times. For example he may set his office 
number to be at the top of the hierarchy between 9AM and 5PM and his home 
number to be at the top of the hierarchy between 6PM and 8AM with his 
mobile number at the top of the hierarchy at all other times. 

The setting of the hierarchy is preferably carried out using a 
programming interface, again using voice and text communication as 
appropriate to the user's method of connection. 

In a further embodiment, a contact point, such as a phone number, is 
usable as an input to obtain other details of a subscriber. Preferably, such a 
feature is provided only with the approval of the individual subscriber. 

An automated directory assistance package according to a preferred 
embodiment of the present invention carries out an initial dialogue with the 
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enquirer, for example by asking him firstl for the name of a town or otherwise 
for information that will allow the system to identify one of a plurality of 
databases or database parts to be used in a later search. Once a database, or 
database part, or directory, has been identified, the enquirer may then be asked 
for a family name and a postcode. If sufficient information is now available to 
arrive at a final result then the system will read the resulting number to the 
enquirer and/or ask the enquirer if he would like to be connected automatically 
to the number. If the search does not give a final result, for example there is 
more than one person with the same family name in the same postcode, the 
system will ask the enquirer for further information such as a first name or 
initial. The user may answer the question, or request that the system pass on to 
a further question. Likewise if the answer still does not permit a final result, 
then the enquirer may be asked for a street address. 

If the information given at any stage cannot be resolved into either text 
or phonemes, then the enquirer is preferably asked to spell the information he is 
giving. If the information still cannot be resolved, whether because it cannot 
be effectively converted into text or phonemes, or because all the questions 
have been asked and a single search result has not been arrived at, the enquirer 
is preferably passed to a human operator. 

Preferably, the system does not ask the user questions that are not 
helpful in resolving the search. For example, if, using just a family name, a 
number of search results are obtained that differ in the street address but not in 
the first name or initial, the enquirer is preferably asked only about the street 
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address, and the question about the first name or initial is skipped as being 
superfluous. Likewise, if the entries found differ only by first name then the 
user may be asked as a supplementary question to specify the first name. 
Similarly, if the entries found differ in the spelling of the family name (it will 
be recalled that searches are preferably carried out over all associated spellings 
of the same name) then the user may be asked, as a supplementary question, to 
spell the family name. 

Thus, in most enquiries, the majority of available questions are not 
asked because searching and asking questions are carried out in a single 
integrated process that is aimed at distinguishing between search results and is 
preferably terminated when an unambiguous result is obtained. 

The skilled person will be aware that the directory need not be restricted 
to private subscribers as in the traditional white pages telephone directory. 
The directory may equally well encompass business or commercial directories 
or any other kind of searchable database that may require remote interrogation. 
In all of these cases the directory is treated as a database and operation is 
identical. 

Preferably, if the directory is sufficiently large, (several millionentries) 
such that even uncommon family names are likely to appear several times, the 
street address may be asked as part of the initial information prior to beginning 
the search. On the other hand, if the directory is relatively small, (a few 
hundred thousand entries) then only the name and the postcode is preferably 
asked. 
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In one preferred embodiment, the search results may be sent via the 
output 28 to an enquirer using a mobile telephone messaging feature such as 
SMS. This has the advantage of the user being able to store it in the telephone 
internal memory or in a personal address book. 

Preferably, features are provided allowing the enquirer to be passed on 
to a human operator at any time, to ask for additional numbers, and, where a 
subscriber entry contains several telephone numbers, to try each one in turn in 
accordance with a predefined hierarchy until the subscriber is reached. 
Connection to the subscriber can also be done automatically in a serial or 
parallel way without user intervention as described earlier 

Preferably, automatic transfer to a human operator is also provided 
when a search fails to find any matches or when the line is too noisy to allow 
speech recognition, or for example when the number/subscriber data is 
classified. A case of high line noise may be inferred from a series of low 
confidence matches or from direct measurement of noise on the incoming line. 

In a preferred embodiment, transfer to a human operator includes 
playing back to the human operator the answers that the enquirer has already 
given to the automatic system. 

Automatic connection to the number retrieved may be carried out under 
the supervision of a call center, in which the call center handles issues such as 
failure to complete the call. As an alternative, the number may simply be 
dialed and the enquirer left to himself. 
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The coverage of the enquiry service need not be restricted to any 
geographic region or to the subscribers of any given telephone company. The 
enquiry service is restricted only by the availability of databases. In providing 
databases, it is possible to use a strategy of searching a single large database or 
to use several smaller databases, segmentation of the data being carried out, for 
example, according to geographical information. The data in the database need 
not be restricted to search fields and telephone numbers. E-mail addresses and 
like information may also be included. An individual database may be a 
company-specific database and may thus include information such as 
department or other organizational information. 

In a further preferred embodiment of the present invention, a personal 
address book feature is provided. In this embodiment, entries retrieved for a 
given enquirer are stored in the personal address book for retrieval later on by 
that enquirer, so that he does not need to request the same information again. 
For example, the retrieved entries may be stored in the portable phone 14 as a 
personal address book feature. Storage of the data by the enquirer may be 
automatic, meaning that the system is set up to store the results of any enquiry 
in such a personal address book . Alternatively, storage may be on demand, in 
that the enquirer may explicitly request the data to be stored. As well as the 
dialogue-based interface, the system is also able to use the WAP interface or 
any other telephone-data interface. The enquirer input may be sent as text via 
the WAP interface, or as speech or via SMS, and, as before, the results may be 
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sent as any one of the three (or as fax , email, or any other available format). 
Data sent as text may then be displayed on a screen. 

The system can be used to provide what is known as reverse searching, 
that is, searching using the telephone number or like result field as the search 
field. An application of reverse searching is to identify the senders of incoming 
calls or messages. Thus, instead of displaying a telephone number on the 
screen it is possible to display a sender name. In a particularly preferred 
embodiment, a database of photographs and or video and or audio clips may be 
used and the reverse search facility may locate and display the retrieved 
photograph or video or audio clip or a combination thereof of the retrieved 
sender. In another embodiment, to be described in greater detail below, the 
reverse output of the directory may be inserted as a header of a voicemail 
message, so as to automatically provide voicemail messages with clear 
identification of the sender. 

In a further preferred embodiment, a database of the kind described 
above may be associated with a location system. There are currently available 
location systems which comprise street maps and can be incorporated into 
vehicles to guide a user to a destination. Such systems may include GPS 
operability so that the current location of the user can be compared to a desired 
location so as to give the user a route to the desired location. In the present 
embodiment, the system is combined with a database such that the enquirer 
gives the name or other identification information, the database obtains an 
address and the location system advises on a best route to reach the address. 
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Advice on the best route may be given as spoken instructions or as written data 
or as a path displayed on a map screen as appropriate for the device through 
which the enquiry is made or to which the enquiry result is sent. Preferably, 
the same spoken/written instructions are stored in several languages and the 
user is able to select the desired language. 

Reference is now made to Fig. 2, which is a flow diagram showing in 
greater detail the way in which an incoming user enquiry is processed to 
produce a query for the database and a usable output for the user. In Fig. 2, in 
an initial stage S10 data input is received from the user. The system 
determines whether the incoming data is text or speech in a decision stage SI 2. 
If the data is found to be speech then it is preferably converted to text in a 
conversion stage S14. As discussed above the speech may alternatively or 
additionally be converted into phonemes. The resulting text or phonemes may 
then be formulated into a query in a stage SI 6. The number of fields are then 
used to determine, in determination stage SI 7, whether the query is sufficiently 
strong to obtain a useful result. If not, further data is obtained by asking a 
further question of the enquirer in a stage SI 8. If the data is judged to be 
sufficient then it is used to query the database in stage S20. If the database 
does not give a sufficient outputin determination stage S22, then the procedure 
continues to S18 and asks the user a further question, preferably judiciously 
chosen to enable the system to discriminate between multiple search results. 
Once a sufficient answer is available the system outputs an answer to the user 
in output stage S24. 
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Reference is now made to Fig. 3, which is a flow chart showing in more 
detail the procedure of SI 8 in Fig. 2, namely that of determining a further 
question to ask. The database 16, and likewise the associated query, is 
preferably divided into a series of fields each containing a different part of the 
information that identifies the subscriber, each part being possible to be 
associated with a different question. Preferably, as discussed above, the 
questions are arranged in a hierarchy so that information more critical to the 
search is entered early on during the query formulation process. For example a 
family name field is more critical in searching a directory than say a street 
number field. In addition, natural human usage is taken into account in 
arranging the hierarchy, again, as discussed above. The procedure of stage 18 
determines a further question to ask by noting which fields in the query being 
formulated currently contain information and then determining which is the 
next empty field in the hierarchy. A question associated with the next empty 
field is then selected and asked. For example, the database may comprise the 
following fields in hierarchical order, surname, first initial, second initial, street 
address, zip code. If the surname has already been obtained then the next field 
in the hierarchy is the first initial field. A question asking the user for the first 
initial is thus selected and asked. 

Previously retrieved but incomplete results from the database are also 
used as operands in the procedure if available. In a stage S30 a next empty 
field in the hierarchy is obtained. If current but incomplete results are available 
then it is determined whether filling in the next empty field will assist in being 
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able to complete the search. For example if the next empty field is the first 
name, and the currently retrieved results comprise two candidate results that do 
not differ in the first name field, then the field is skipped and the following 
empty field is proceeded to until a relevant field is found for which asking a 
question is worthwhile. 

A question associated with the relevant empty field is then selected in 
stage S32. For example, if the field is the street address then the question 
selected may be "What is the required street address". In stage S34 the 
question is preferably sent for output to the enquirer. As discussed in detail 
above, the enquiry may be a voice or a text enquiry and the question will thus 
be sent for output in the appropriate format. 

Reference is now made to Fig. 4, which is a flow diagram explaining 
operation in the event of a user response being recognized with a low level of 
confidence. A threshold confidence level is preferably set such that in a stage 
S40 a low confidence recognition can be identified. In such a case a question is 
preferably output requesting the user to spell his response in a stage S42. 

Reference is now made to Fig. 5, which is a block diagram of an 
architecture for supporting the automatic directory assistance system, in 
accordance with a preferred embodiment of the present invention. The block 
diagram is by way of example only and illustrates an embodiment of the 
invention applied specifically to a node of a high speed backbone network, the 
node using El/Tl and SS7 signaling. In Fig. 5, the architecture 40 comprises a 
standard call center 42 fronted by a front end unit 44, preferably a directory 
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assistance telephony and RM server 44 attached directly to the high speed 
backbone network (HSBN) 46 of the telephone network. Also connected to the 
HSBN are a call control signaling (CCS) unit 48, a directory assistance 
interface server 50, a directory access administration server 52, and an 
administrative LAN 54 to which are attached interface servers to SMS 56 and 
to WAP 58 and the subscriber directory 60. An alarm 62 is also attached to the 
administrative LAN 54 and preferably administers all the alarms on the system. 

The CCS unit 48 is optional and preferably handles the SS7 call control 
signaling referred to above. 

The DA telephone and RM server 44 is a front end unit of the directory 
assistance architecture 40. The front end unit interfaces to the telephony 
system and for example stores or caches voice prompts. Additionally it may 
comprise text to speech software to provide voice prompts as necessary. Each 
front end server 44 is associated with at least one, preferably more than one, 
back end unit, here indicated as a DA session and DB unit 64 and the front end 
server 44 preferably carries out resource management tasks therefor. The DA 
interface server 50 operates the WAP interface 58, SMS interface 56 and any 
other interfaces to external systems that may be included. The DA telephone 
and RM server 44 preferably takes the results of the directory assistance 
session to complete the call, that is to say make the connection, for the 
enquirer. 
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DA session and DB unit 64 preferably carries out handling of the 
directory assistant dialog session and the database search session as explained 
above in respect of Fig. 1 . 

The DA interface server 50 preferably serves to send SMS messages via 
TCP/IP to the SMS server 56 and WAP messages to the WAP server 58. 

The DA administration unit 52 is a supervisory and programming unit, 
preferably used to set up, manage and reprogram the system. For example, 
new dialog would be entered via DA administration unit 52. 

It is appreciated that certain features of the invention, which are, for 
clarity, described in the context of separate embodiments, may also be provided 
in combination in a single embodiment. Conversely, various features of the 
invention which are, for brevity, described in the context of a single 
embodiment, may also be provided separately or in any suitable 
subcombination. 

It will be appreciated by persons skilled in the art that the present 
invention is not limited to what has been particularly shown and described 
hereinabove. Rather the scope of the present invention is defined by the 
appended claims and includes both combinations and subcombinations of the 
various features described hereinabove as well as variations and modifications 
thereof which would occur to persons skilled in the art upon reading the 
foregoing description. 



